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Supervisory  ratings  are  the  most  common  method  of  performance 
evaluation.  In  using  such  ratings,  one  employee  (usually  a supervisor) 
is  required  to  rate  another  employee  on  several  traits  or  performance 
dimensions.  The  pitfalls  inherent  in  the  use  of  such  criteria  are 
well  known.  However,  relatively  little  research  has  been  done  on  racial 
bias  as  a source  of  contamination  in  performance  rat i ngs.^The  primary 
purpose  of  this  research  was  to  assess  the  degree  to  which  rating 
behavior  is  affected  by  the  race  of  rater  and  ratee. 

Three  primary  hypotheses  were  tested:  (1)  Evaluations  of  job  perform-; 

ance  are  related  to  the  race  of  rater  and  ratee;  (2)  In  the  absence  of 
relevant  information  perceived  attitude  similarity  is  related  to  the  race 
of  the  individuals  involved;  and  (3)  The  effects  of  racial  bias  on  pei — 
formance  ratings  will  depend  upon  the  cognitive  balance  between  rater, 
ratee,  and  the  ratee' s performance. 

Eight  hypothetical  college  instructors  were  evaluated  by  17*+  white 
and  129  black  college  students.  Each  instructor's  teaching  performance  was 
described  using  statements  of  predetermined  favorab i 1 i ty . Three  levels  of 
teaching  performance,  good,  average,  and  poor  were  described-.  The  race  of 
the  instructors  was  manipulated  through  the  use  of  photographs.  Each  sub- 
ject evaluated  every  instructor  in  terms  of  (a)  quality  of  performance, 

(b)  promotab i 1 i ty , (c)  agreement  with  his  teaching  philosophy,  (d)agreement 
with  his  philosophy  about  students  and  (e)  expected  change  in  future 
performance. 

The  hypotheses  were  tested  using  correlational  analyses.  Race  of 
rater  and  ratee  was  not  related  to  the  performance  evaluations  of  good 
instructors.  Where  teaching  performance  was  described  as  poor,  white 
raters  favored  white  instructors  over  black  instructors  in  terms  of  quality 
of  performance.  Where  teaching  performance  was  described  as  average,  white 
raters  favored  average  black  instructors  over  white  instructors  in  terms  of 
promotab i 1 i ty.  Black  raters  favored  average  black  instructors  over  average 
white  instructors  in  terms  of  quality  of  performance  and  promotab i 1 i ty . It 
was  deduced  that  performance  ratings  were  based  primarily  on  teaching 
performance,  but,  when  teaching  performance  was  not  exceptional,  rater- 
ratee  racial  similarity  was  a factor  as  well. 

When  teaching  performance  was  described  as  very  good,  there  was  no 
significant  relationship  between  rater-ratee  racial  similarity  and  agree- 
ment with  the  ratee's  beliefs.  However,  white  raters  indicated  that  they 
would  agree  more  with  the  beliefs  of  a poor  white  instructor  than  a poor 
black  instructor.  Black  raters  indicated  that  they  would  agree  more  with 
the  beliefs  of  an  average  black  instructor  than  an  average  white  instructor 
It  was  concluded  that  perceived  belief  similarity  was  Dased  more  on  teach- 
ing performance  than  race,  but,  when  teaching  performance  was  not  excep- 
tional, rater-ratee  racial  similarity  was  a factor.  Manipulation  of  the 
independent  variable  was  not  sufficient  to  determine  the  effects  of 
cognitive  balance  on  rating  behavior. 
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CHAPTER  I 


HJERODUCTION 

Job  performance  criteria  are  rules  or  standards  by  vhlch  work  per- 
formance can  be  Judged.  Etamples  of  auch  criteria  lnctade,  quality  of 
vrork,  quantity  of  cork,  absenteeism,  promotablllty,  and  supervisors' 
ratings.  Mhen  appropriately  developed,  and  applied,  criterion  measures 
are  useful  lnllces  of  Job  performance.  Unfortunately  moat  Job  perform- 
ance  criteria  are  poorly  developed  and  Inequitably  applied.  As  a conse- 
quence, selection  and  training  programs  often  contain  tbe  same  faults 
as  the  criteria  on  which  they  were  developed. 

Criterion  Contamination 

Although  most  researchers  recognize  that  contaminated  criterion 
measures  can  significantly  affect  predicts,  validity,  the  problem  has 
been  virtually  ignored.  This  is  especially  true  vhere  employment  dis- 
crimination is  concerned.  Most  of  the  research  in  this  area  has  center- 
ed around  the  question  of  differential  predictability  of  minorities  end 
non-minorities,  (Bartlett  & O'Leary,  1969;  Bray  A Moses,  1972;  Cleary, 

1968;  Parr,  O’Leary  & Bartlett,  19T1;  Kirkpatrick,  et  al.,  1968; 

Lefkovitt,  1972;  lopez,  1966).  In  this  research  little  attention  ves 

given  to  criterion  contaminations 

Criterion  contamination  can  be  defined  as  the  degree  to  which  the 
oeasures  actually  used  to  determine  job  success  (e.g.  supervisory  ratings) 
contain  elements  that  are  unrelated  to  true  measures  of  job  success 
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(Blum  & Baylor , 1968).  One  of  the  components  of  criterion  contamination 
Is  systematic  error  or  criterion  bias.  Criterion  bias  refers  to  those 
components  of  an  actual  criterion  which  are,  (a)  unrelated  to  Job  suc- 
cess and,  (b)  correlated  with  one  or  more  variables  such  as  rater  len- 
iency or  opportunity  bias. 

Prior  to  focusing  directly  on  the  issue  of  racial  bias  as  a contam- 
inant of  criterion  measures,  it  may  be  useful  to  consider  ways  in  which 
criterion  contamination  has  been  conceptualized  in  the  past.  Kipnis 
(i960),  in  discussing  the  lack  of  attention  given  to  bias  in  criterion 
ratings,  outlined  some  of  the  factors  which  may  contribute  to  it.  He 
proposed  that  a supervisor  may  systematically  err  in  rating  a subordinate 
by  rating  too  leniently,  too  harshly,  or  by  allowing  a few  performance 
dimensions  to  unduly  affect  evaluation  of  other  performance  dimensions. 

Leniency  in  Ratings 

Some  of  the  factors  which  seem  to  influence  rater  leniency  are 
propinquity  between  rater  and  ratee;  the  amount  of  pressure  under  which 
the  supervisor  works;  the  degree  to  which  the  supervisor  is  allowed  to 
criticize  a subordinate  during  the  course  of  their  interactions;  and 
the  type  of  rating  scale  employed. 

Propinquity.  Physical  distance  between  individuals,  such  as  em- 
ployees, dictates  whether  they  develop  friendships  (Byrne  & Buehler, 

1955;  Caplow  & Forman,  1950;  Festinger,  Schacter  & Bach,  1950;  Kipnis, 
1957).  Thi3  suggests  that  the  closer,  in  physical  terms,  an  employee 
is  to  his  or  her  supervisor,  the  greater  the  likelihood  of  their  be- 
coming friends.  This  may  in  turn  affect  the  employee's  performance 
rating.  In  the  industrial  setting,  factors  such  as  the  physical  ar- 
rangement of  work  space  or  the  assignment  of  work  tasks,  may  cause 
leniency  in  performance  ratings. 
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Social  setting.  Another  factor  which  affects  rater  leniency  is  the 
social  setting  within  which  performance  evaluations  are  made.  Changes 
in  interpersonal  structure  within  the  work  setting  can  produce  changes 
in  the  degree  to  which  group  members  like  each  other.  Such  changes 
could  influence  the  performance  evaluations  of  the  employees  involved. 

Pressure.  The  degree  of  pressure  a supervisor  receives  from  the 
social  setting  may  affect  his  or  her  ratings  of  subordinates.  Research 
by  Murray  (1933)  and  Feshbach  and  Singer  (1957)  indicates  that  stress 
brought  about  by  pressure  will  decrease  the  level  of  positive  sentiment 
between  individuals.  The  greater  the  stress,  the  more  likely  the  rater 
is  to  project  maliciousness  and  aggression  onto  the  person  being  rated. 
Rigidly  enforced  personnel  policies  or  outside  pressures  such  as  EEOC 
guidelines  could  produce  stressful  pressure  resulting  in  systematic  bias 
in  criterion  ratings. 

Criticism.  Research  on  rater  leniency  also  indicates  that  individ- 
uals who  are  restrained  from  criticizing  others  tend  to  develop  less 
friendly  feelings  toward  them  (Thibaut  & Coules,  1952;  Pepitone  4 
Kleiner,  1957).  Spector  (195*0  found  that  blocking  rater  criticism  of 
a subordinate' 3 performance  tended  to  result  in  less  lenient  evalua- 
tions. He  found  that  the  most  lenient  ratings  were  given  by  supervisors 
who  were  allowed  to  criticize  the  ratee. 

The  rating  scale.  The  rating  scale  itself  can  influence  the  degree 
to  which  leniency  occurs.  It  has  been  well  documented  that  rating 
scales  differ  in  the  degree  to  which  they  minimize  rater  leniency 
(Campbell,  Dunnette,  Arvey  4 Hellervik,  1973;  Madden  4 Bourdon,  196k; 
Sharon  & Bartlett,  1969)*  Madden  and  Bourdon  (196*+)  constructed  nine 
variations  of  mean  occupation  evaluation  ratings.  The  results  showed 
that  different  rating  scale  formats  produced  significant  differences 
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In  evaluations.  Campbell  et  al.  (1973)  developed  behavioral ly  anchored 
rating  scales  for  managerial  performance.  The  behaviorally  anchored 
scales  were  compared  with  traditional  Likert  type  scales.  The  results 
showed  that  the  Likert  type  scales  were  more  susceptible  to  leniency 
error.  Sharon  and  Bartlett  (1969)  assessed  the  degree  to  which  a forced 
choice  rating  scale  minimized  leniency  error  in  comparison  with  a graphic 
rating  scale.  Leniency  was  defined  as  a favorable  shift  in  mean  ratings 
from  one  rating  condition  to  another.  They  found  that  the  forced  choice 
scale  yielded  significantly  leas  leniency  error. 

Halo 

There  is  evidence  which  indicates  that  a limited  class  of  behaviors 
can  influence  ratings  of  other  aspects  of  job  performance.  That  is,,  a 
particular  trait  can  influence  the  evaluation  of  other  aspects  of  Job 
performance.  This  may  explain  differences  often  found  between  super- 
visory ratings  and  less  judgemental  measures  of  ratee  performance. 

The  research  literature  indicates  that  performance  ratings  contain 
some  elements  of  bias.  This  bias  has  often  been  conceptualized  as  a 
tendency  on  the  part  cf  raters  to  allow  attitudes  toward  a subordinate 
to  influence  evaluations  of  the  subordinate's  Job  performance.  Little 
research  has  been  done  however  to  determine  whether  characteristics  of 
the  ratee  such  as  ethnic  group  membership  or  the  ratee 's  sex  are  related 
to  evaluations  of  Job  performance. 

Ethnic  Group  Membership  as  a Contaminant  of  Performance  Ratings 

In  a study  more  directly  relevant  to  the  topic  of  this  thesis, 

Bass  and  Turner  (1973)  studied  the  significance  of  ethnic  group  member- 
ship in  relationships  among  criteria  of  Job  performance.  Their  purpose 
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was  to  assess  the  degree  of  racial  discrimination  or  systematic  bias  in 
criterion  measures  of  black  and  white  bank  tellers.  This  particular 
sample  provided  the  opportunity  to  study  both  non- Judgemental  and  rating- 
type  measures  of  Job  performance.  Two  hundred  and  forty-four  part-time 
and  190  full-time  tellers  were  studied.  Thirty  two  of  the  part-time 
tellers  and  forty-three  full-time  tellers  were  black.  All  of  the  tellers 
performed  basically  the  same  work  tasks.  The  performance  evaluations 
were  obtained  from  163  supervisors.  Performance  was  rated  on  the  basis 
of  five  Job  performance  factors  and  overall  effectiveness.  The  five 
factors  were  (l)  customer  relations,  (2)  ability  to  sell  new  accounts 
and  services,  (3)  quality  of  work,  (U)  alertness  to  irregularities,  and 
(5)  cooperation  with  others.  Each  of  the  five  factors  was  evaluated 
using  a five  point  scale.  Overall  effectiveness  was  rated  on  a ten  point 
scale.  Approximately  Qh'ft  of  the  employees  were  rated  by  two  supervisors, 
allowing  inter-rater  reliabilities  to  be  obtained.  The  median  reliabil- 
ity for  the  six  scales  (five  factors  plus  overall  effectiveness)  was  .76. 
In  addition  to  supervisory  ratings,  four  non- Judgemental  measures  of  Job 
performance  were  obtained.  These  were  (a)  number  of  shortages  obtained 
over  a one  month  period,  (b)  number  of  overages  obtained  for  the  seme 
period,  (c)  attendance  (computed  only  for  full-time  tellers)  and,  (d) 
net  salary  increase  for  each  teller  since  first  coming  to  work  (i.e. 
salary  increase  corrected  for  tenure).  Measures  of  three  control  var- 
iables, tenure,  age  and  race  were  also  obtained.  The  results  showed 
that  white  employees,  both  full-time  and  part-time,  had  higher  mean 
scores  on  practically  all  of  the  criterion  variables.  Mean  differences 
were  significant  with  regard  to  customer  relations,  quality  of  work, 
alert ne 33  to  irregularities,  cooperation,  overall  effectiveness,  number 
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of  shortages,  and  adjusted  salary  Increase.  White  employees  were  also 
older  and  more  experienced.  However,  this  does  not  seem  to  account  for 
the  differences  In  criterion  ratings. 

Since  there  were  definite  disparities  between  the  mean  ratings  of 
the  two  groups,  a question  arose  as  to  whether  the  race  of  the  rater  or 
ratee  influenced  rating  behavior.  Intercorrelations  were  computed  be- 
tween all  of  the  variables.  Race  was  significantly  correlated  with  both 
supervisory  ratings  and  objective' measures  of  Job  performance.  Correla- 
tions were  also  computed  between  race  and  the  criterion  measures  with 
tenure  and  age  partialled  out.  It  should  be  recalled  that  the  mean  age 
and  Job  experience  of  the  white  employees  were  higher  them  that  of 
blacks.  This  might  have  lead  to  the  conclusion  that  the  relationships 
between  race  and  the  criterion  variables  were  simply  artifacts  of  dif- 
ferential Job  experience.  The  partial  correlations  indicated  that  this 
was  not  the  case.  There  were  four  significant  correlations  before  age 
and  tenure  were  statistically  controlled.  After  these  variables  had 
been  partialled  out,  two  of  the  four  relatic;.  'ps  (percent  of  the  time 
worked  and  number  of  shortages)  remained  significant.  In  the  case  of 
part-time  employees,  four  of  the  original  seven  correlations  (quality 
of  work,  overall  effectiveness,  number  of  shortages  and  adjusted  salary 
increase)  were  significant  after  age  and  tenure  were  partialled  out. 
Control  for  the  effects  of  age  and  tenure  reduced  but  did  not  eliminate 
the  relationship  between  race  and  measures  of  Job  performance. 

The  fact  that  blacks  received  significantly  lower  performance 
ratings  than  whites  does  not  necessarily  mean  that  they  were  discriminat 
ed  against.  It  is  possible  that  the  black  employees  as  a group  were 
poorer  performers  than  white  employees.  In  any  event,  the  major  conclu- 
sion drawn  by  Baas  and  Turner  was  that  the  supervisors  tended  to  use 
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less  Judgemental  data  In  evaluating  black  tellers  but  did  not  do  so  for 
whites.  Their  conclusion  was  based  upon  the  finding  that  supervisory 
ratings  of  black  employees  were  more  consistent  with  less  Judgemental 
indices  of  Job  performance  (e.g.  number  of  shortages  and  overages)  than 
ratings  of  white  employees.  Another  explanation  for  Bass  and  Turner's 
results  is  that  the  supervisors  (all  white)  felt  psychologically  dis- 
tant from  black  employees. 

Regardless  of  whether  there  were  differences  in  the  factors  used 
to  evaluate  the  tellers  or  differences  in  psychological  distance,  the 
results  of  Bass  and  Turner's  study  indicate  that  it  would  be  difficult 
to  develop  predictors  of  the  bank  tellers’  performance.  If  supervisory 
ratings  were  used  as  the  criterion,  a given  predictor  would  probably 
not  show  the  3ame  degree  of  correlation  with  the  performance  ratings  of 
blacks  as  it  would  with  ratings  of  whites.  The  supervisors  apparently 
used  different  standards  for  evaluating  the  two  groups.  If  as  Bass  and 
Turner  report,  non- Judgemental  data  (i.e.  shortages  and  overages)  was 
used  more  in  evaluating  black  employees,  a measure  of  numerical  ability 
might  show  the  highest  degree  of  correlation  with  their  performance  rat- 
ings. Such  a predictor  would  probably  show  little  correlation  with  per- 
formance ratings  of  white  employees  since  use  of  objective  data  was  not 
emphasized  in  evaluating  them.  Personality  or  motivational  factors  were 
apparently  used  more  in  evaluating  white  employees.  Therefore  some  sort 
of  personality  inventory  might  be  more  useful  in  predicting  their  per- 
formance ratings.  In  order  to  utilize  the  same  predictor(s)  for  both 
groups,  Job  performance  standards  would  have  to  be  applied  more  uniform- 
ly. 

P laugher,  Campbell  and  Pike  (1969)  present  data  from  a study  de- 
signed to  assess  whether,  (a)  criterion  ratings  are  affected  by  the  race 
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of  the  rater  and  rates,  and  (b ) whether  rater-ratee  racial  group  member- 
ship affects  the  correlation  between  subjective  and  objective  criterion 
measures.  Their  study  was  part  of  a larger  investigation  which  dealt 
directly  with  validation  of  a predictor  against  a criterion  categorised 
on  the  basis  of  the  race  of  rater  and  ratee.  In  the  1969  study  the 
authors  developed  a nine  scale  performance  rating  fora  with  which  to 
assess  the  Job  performance  of  168  black  and  296  white  medical  technicians. 
All  of  the  technicians  were  employees  of  U.  3.  Veterans  Administration 
Hospitals  throughout  the  country.  Each  technician  was  rated  by  at  leext 
two  supervisors.  The  average  rating  was  used  for  analysis.  The  nine 
rating  scales  were  developed  from  interviews  with  encumbents.  In  addi- 
tion to  supervisory  ratings,  a Job  knowledge  test  wag  developed  as  a 
more  objective  measure  of  Job  performance.  White  technicians  received 
slightly  higher  ratings  on  all  of  the  scales  except  flexibility.  White 
technicians  also  received  higher  scores  on  the  Job  knowledge  test.  A 
Wilcoxon  matched  pairs  signed-ranks  test  was  used  to  test  the  difference 
between  the  two  groups  on  all  of  the  ratings  taken  collectively.  The 
difference  was  significant  at  the  .01  level.  A t-test  indicated  that 
the  mean  difference  between  groups  on  the  job  knowledge  test  was  sig- 
nificant at  the  .01  level.  The  researchers  further  sub-grouped  their 
data  by  racial  combination  of  rater  and  ratee.  When  analysed  in  this 
manner,  the  results  showed  that  black  technicians  rated  by  black  super- 
visors received  higher  ratings  than  blacks  rated  by  whites.  This  was 
true  for  all  of  the  scales  and  the  Job  knowledge  test.  There  are  sever- 
al hypotheses  as  to  why  thi3  might  have  occurred. 

One  hypothesis  is  that  black  supervisors  were  more  lenient  toward 
technicians  of  their  own  race.  This  hypothesis  seems  tenable  even 
though  ratings  of  technicians  rated  by  black  supervisors  were  highly 
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correlated  with  scores  on  the  job  knowledge  test.  A second  hypothesis 
is  that  black  supervisors  were  generally  more  lenient  than  white  super- 
visors. This  is  definitely  a possibility.  There  is  little  certainty 
based  upon  the  data  that  either  group  actually  performed  better  than 
the  other.  One  way  to  solve  this  problem  would  be  to  control  the  char- 
acteristics of  the  ratees  in  such  a manner  that  each  rater  receives  the 
same  information  about  the  person  to  be  rated.  Still  a third  hypothesis 
is  that  the  two  groups  of  supervisors  placed  differential  weight  on 
various  criterion  components  in  rating  black  and  white  subordinates. 

This  hypothesis  seems  tenable  considering  the  correlations  between  per- 
formance ratings  and  scores  on  the  Job  knowledge  test.  The  correlations 
were  higher  for  black  technicians  rated  by  black  supervisors  than  for 
white  technicians  rated  by  black  supervisors.  It  appears  that  black 
supervisors  rated  black  employees  on  the  basis  of  criterion  elements 
similar  to  those  measured  by  the  Job  knowledge  test.  They  apparently 
did  not  do  30  in  rating  white  employees.  A fourth  hypothesis  is  that 
white  supervisors  were  more  objective  when  rating  black  tellers.  This 
hypothesis  also  seems  tenable  given  that  the  correlations  of  performance 
ratings  with  test  scores  were  higher  for  blacks  rated  by  whites  than 
for  whites  rated  by  whites.  This  pattern  did  not  exist  for  black  raters. 
Their  ratings  of  white  subordinates  appeared  to  be  less  objective  than 
their  ratings  of  black  subordinates. 

AH  of  the  hypotheses  above  could  explain  the  results  of  the  Flaugher 
study.  The  major  obstacle  to  eliminating  these  hypotheses  is  control  of 
the  quality  of  ratee  performance.  If  this  were  accomplished,  more  valid 


estimates  of  rater  bias  would  result. 


10 

Ethnic  Group  Membership  Aa  It  Affects 
Mon-Supervisory  Performance  Ratings 

Obviously  there  are  measures  other  than  supervisory  judgements 
which  may  be  used  to  differentiate  acceptable  from  unacceptable  Job 
performance.  One  such  measure  is  the  peer  rating.  Although  they  are 
not  used  as  often  as  supervisory  ratings,  some  research  has  been  done 
regarding  the  effects  of  ethnic  group  membership  on  peer  ratings.  This 
research  provides  a glimpse  of  the  extent  to  which  race  of  the  rater 
and  ratee  Influence  performance  measures  other  than  supervisory  ratings. 

Cox  and  Kromboltz  (1959)  collected  leadership  peer  ratings  from 
63  black  and  U69  white  basic  airmen  at  an  air  force  base  in  Texas.  Peer 
ratings  were  collected  as  the  men  began  their  third  week  of  basic  train- 
ing. The  men  were  assigned  to  various  flights  in  the  order  in  which 
they  arrived  for  processing.  Each  flight  consisted  of  about  60  airmen. 
The  airmen  rated  each  other  on  the  basis  of  the  following  instructions: 

Several  airmen  from  your  flight  will  be  selected  for  positions 
such  as  squad  leader,  right  guide,  assistant  flight  leader 
and  so  forth.  You  are  asked  to  rate  each  member  of  your 
flight  as  to  how  well  you  think  he  can  do  such  a Job.  The 
selection  of  airmen  for  positions  will  be  made  on  the  basis 
of  your  ratings. 

The  ratings  were  made  according  to  the  following  scale: 

5 - Very  well  qualified  for  the  position,  one  of  the  best  in  the 
flight 

k • A well  qualified  man  for  the  position 
3 - Would  not  do  a bad  Job  or  a good  one;  about  average 
2 - Mot  well  qualified  for  the  Job;  would  do  a poor  Job  at  this 
time 

1 - Poorly  qualified  for  the  Job,  would  do  a very  poor  Job  at 
this  time 

Ratings  were  subgrouped  on  the  basis  of  the  race  of  rater.  Each 
man  received  two  ratings.  One  was  determined  by  averaging  the  ratings 
given  by  the  blacks  in  hi3  flight.  The  other  rating  was  determined  by 
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averaging  the  ratings  given  by  the  white  members  of  his  flight.  An 
analysis  of  variance  was  carried  out  on  ratings  given  to  blacks  and  to 
whites.  The  results  showed  that  there  was  an  interaction  between  race 
and  flight.  That  is,  blacks  tended  to  give  higher  ratings  to  blacks, 
but  this  was  not  true  for  all  flights.  By  the  same  token,  whites  gave 
higher  ratings  to  whites  but  this  was  not  true  for  every  flight.  Cox 
and  Srumboltz'  data  show  that  racial  bias  in  criterion  ratings  may  exist 
even  when  peer  ratings  are  used.  However,  the  small  number  of  airmen 
participating  in  the  study  creates  the  possibility  that  friendships 
between  flight  crew  members  not  racial  bias  accounted  for  the  findings. 

Schmidt  and  Johnson  (1973)  studied  the  effects  of  race  on  peer 
ratings  in  an  industrial  setting.  Their  results  were  supportive  of  the 
data  from  the  Cox  and  Srumboltz  study,  but  their  methodology  was  some- 
what different.  Instead  of  merely  assessing  the  degree  to  which  the 
samples  differed  in  their  ratings  of  each  other,  Schmidt  and  Johnson 
conducted  sensitivity  training  specifically  designed  to  eliminate  racial 
bias.  Although  there  was  still  a tendency  for  raters  to  give  higher 
ratings  to  persons  of  their  own  race,  the  differences  were  not  signific- 
ant. 

There  have  been  few  studies  reported  in  the  literature  assessing 
the  relationship  between  race  and  peer  ratings.  The  evidence  indicates 
that  higher  ratings  are  given  when  rater  and  rates  belong  to  the  same 
race.  More  research  must  be  done  which  eliminates  rival  hypotheses 
before  stronger  conclusions  can  be  drawn. 

Criterion  Bias  In  Test  Validation 

So  far  this  discussion  ha3  dealt  exclusively  with  racial  group 
membership  as  a moderator  of  the  relationship  between  race  and  criterion 
ratings.  What  effect  does  rster-ratee  racial  group  membership  have  on 
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the  correlation  between  a given  predictor  or  predictors  and  criterion 
measures?  Evidence  bearing  upon  this  question  Is  scant.  Campbell  et  al. 

(1970)  conducted  a concurrent  validation  study  dealing  directly  with 
this  Issue.  The  purpose  of  the  study  was  to  validate  a test  against 
different  criterion  measures  subgrouped  according  to  various  racial 
combinations  of  rater  and  rates . The  rater-ratee  combinations  were, 

(a)  blacks  rated  by  blacks;  (b)  blacks  rated  by  whites;  (c)  whites  rated 
by  whites;  and  (d)  whites  rated  by  blacks.  Correlations  between  super- 
visory ratings  and  each  of  nine  aptitude  tests  were  computed  separately 
far  each  rater -rat ee  combination.  The  subjects  were  198  black  and  217 
white  medical  technicians.  The  results  showed  that  race  of  the  rater 
and  ratee  may  have  affected  the  validity  coefficients  obtained.  Only 
one  of  the  predictors  (arithmetic)  showed  a consistent  correlation  with 
supervisory  ratings  across  all  of  the  rater-ratee  combinations. 

An  Experimental  Study 

The  investigations  reviewed  thus  far  have  dealt  with  both  super- 
visory ratings  and  peer  ratings.  None  of  them  however  were  conducted  in 
a laboratory  setting  where  ratee  characteristics  and  rater  observations 
could  be  controlled.  This  type  of  study  would  allow  the  researcher  to 
assess  racial  bias  which  i3  experimentally  induced.  Hamner  et  al.  (197*0 
conducted  such  a study.  The  purpose  of  their  investigation  was  to  assess 
whether  performance  ratings  were  free  of  rater  bias  when  the  quality  of 
ratee  performance  was  systematically  controlled.  Their  subjects  were 
thirty-3ix  undergraduate  business  students.  The  subjects  included  nine 
black  males,  nine  black  females,  nine  white  males,  and  nine  white  females. 

They  were  asked  to  play  the  role  of  a grocery  store  manager.  In  this 
capacity  they  were  to  screen  for  potential  employment  eight  ratees  who 
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had  supposedly  applied  for  the  Job  of  stock  clerk.  The  eight  hypothet- 
ical applicants  consisted  of  two  white  males,  two  white  females,  two 
black  males  and  two  black  females.  The  subjects  were  told  that  each 
hypothetical  applicant  had  taken  a three  minute  work  sample  examination 
as  part  of  the  screening  process.  The  examination  consisted  of  remov- 
ing number  ten  cans  from  an  opened  case  and  placing  them  on  grocery 

t 

store  shelves.  The  researchers  had  determined  from  personal  experience 
that  the  average  worker  could  stack  36  cans  In  a three  minute  period;  a 
very  good  (high  performance  worker)  could  stack  k8  cans;  and  a low  per- 
formance worker  could  stack  only  2k  cans  within  the  same  three  minute 
period.  The  eight  Job  applicants  or  ratees  were  trained  so  that  they 
all  stacked  the  cans  the  same  way.  All  of  the  ratees  wore  a light 
shirt,  dark  slacks  and  dark  grocer's  apron.  A panel  of  graduate  stud- 
ents agreed  that  the  ratees  looked  similar  in  dres3,  method  of  task  per- 
formance (i.e.  use  of  both  hands),  quality  of  work  and  facial  expression. 
The  university  media  center  designed  a grocery  store  set  and  filmed  the 
performance  of  the  eight  ratees.  The  film  was  shown  to  the  36  subjects 
in  the  following  order: 

1st  - low  performing  black  male 
2nd  - high  performing  white  female 
3rd  - low  performing  black  female 
kth  - high  performing  white  male 
5th  - low  performing  white  male 
6th  - high  performing  black  female 
7th  - high  performing  black  male 
8th  - low  performing  white  female 

Overall  performance  ratings  of  the  eight  ratees  were  obtained  after  the 
subjects  had  viewed  the  2k  minute  film.  The  film  was  stopped  after  each 
three  minute  segment  to  allow  the  subjects  time  to  write  down  comments 
about  each  applicant's  performance.  Applicants  were  rated  on  a 15  point 
scale  with  the  following  scale  points: 
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(a)  weak  In  overall  performance,  failed  to  produce  acceptable 
amount  of  production 

(b ) performance  periodically  fell  belcw  that  which  I would  expect 
from  similar  Individuals  In  this  task 

(c)  did  a satisfactory  Job,  was  an  average  worker  in  his/her 
performance 

(d)  was  a good  worker,  his/her  performance  was  better  than  average 

(e)  an  exceptionally  good  worker  who  will  be  equalled  by  few  others 

There  were  significant  differences  In  the  ratings  based  upon  rater-ratee 
race  and  sex.  A2X2X2X2X2  design  was  used  to  test  the  effects 
of  (a)  rater's  race  (black/white),  (b)  rater's  sex,  (c)  ratee's  race 
(black/white),  (d)  ratee's  sex  and  (e)  ratee's  performance  level.  An 
F-test  for  the  main  effect  of  level  of  performance  was  significant. 

High  performance  ratees  were  rated  significantly  higher  than  low  per- 
formance ratees  (F  » 70.4-5,  df  » 1/32,  p < .001). 

There  was  also  a significant  main  effect  for  3 ex  of  ratee.  Female 
applicants  were  rated  significantly  higher  than  males  (F  = 16.79,  df  ■ 
l/32,  p < .001).  This  was  especially  true  for  good  applicants.  It  was 
also  found  that  black  raters  gave  black  applicants  significantly  higher 
ratings  than  white  applicants  (a  mean  of  8.50  to  7.72).  White  raters 
gave  higher  ratings  to  whites  them  to  blacks  (a  mean  of  8.27  to  7-30). 

An  F-test  for  the  main  effect  of  rater-ratee  race  and  sex  was  signific- 
ant (F  =»  4. 71,  df  - 1/32,  p < .05).  An  interaction  wa3  found  between 
race  and  level  of  performance.  On  overall  task  performance,  low  per- 
forming blacks  and  high  performing  blacks  were  rated  similarly,  7*31 
and  8.98  respectively.  This  difference  was  not  significant.  Low  per- 
forming whites  and  high  performing  whites  received  quite  dissimilar 
ratings,  10. 36  and  6.63  respectively.  Therefore,  high  performing  whites 
were  favored  over  high  performing  blacks.  Omega  square  values  indicated 
that  although  30$  of  the  variance  in  overall  ratings  was  accounted  for 
by  performance  level,  an  additional  23$  was  accounted  for  by  different 


race-sex  combinations. 
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The  data  from  ffaaner  et  al.  appears  to  support  the  hypothesis  that 
race- sex  bias  in  performance  ratings  does  occur.  It  would  appear  that 
raters  tend  to  give  higher  ratings  to  individuals  of  their  own  race  or 
sex.  There  are  however  some  serious  problems  with  the  procedure  employ- 
ed by  Hamer  et  al.  which  leave  this  hypothesis  open  to  question.  First , 
the  instructional  set  resembled  a selection  procedure  more  than  a per- 
formance evaluation  procedure.  Subjects  rated  the  performance  of  fic- 
titious Job  applicants  not  encumbents.  The  subjects  were  in  essence 
screening  applicants  for  hire  using  the  work  film  as  a predictor,  as 
opposed  to  Judging  actual  Job  performance.  The  study  may  have  demon- 
strated bias  in  hiring  preferences,  but  not  bias  in  criterion  measures. 
Secondly,  differences  in  the  ratings  may  have  been  due  to  contrast 
effects  brought  about  by  the  order  of  presentation  of  the  applicants. 

Wexley  et  al.  (1972)  found  that  students'  ratings  of  Job  applicants 
(presented  on  video  tape)  were  influenced  by  the  ratings  of  preceding 
applicants.  They  found  that  when  an  applicant  with  high  qualifications 
was  preceded  by  applicants  of  high  quality,  he  was  rated  lower  than  a 
similar  applicant  who  was  preceded  by  individuals  of  low  qaulity.  In 
the  Hamner  et  al.  study,  high  and  low  performing  black  males  may  have 
been  rated  similarly  because  high  performing  black  males  were  preceded 
by  high  performing  black  females. 

Aside  from  various  methodological  considerations,  the  Hamner  study 
suffers  from  another  serious  short -coming . The  authors  made  no  attempt 
to  measure  the  processes  underlying  variance  in  the  dependent  variable. 

Ifo  attempt  was  made  to  measure  the  psychological  processes  (i.e.  prejudice, 
ethno centrism,  authoritarianism)  which  may  have  accounted  for  the  dif- 
ferential evaluation  of  applicants  on  the  basi3  of  race.  This  criticism 
also  applies  to  other  reserach  in  this  area. 
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The  research  most  relevant  to  the  psychological  processes  under- 
lying racial  bias  in  criterion  ratings  comes  from  social  psychological 
literature.  Racial  bias  in  performance  evaluation  seems  to  be  similar 
to  racial  prejudice  as  defined  by  social  psychologists.  Rokeach,  Smith 
and  Evans  (i960),  propose  that  racial  prejudice  is  in  large  part  the 
result  of  perceived  dissimilarity  of  beliefs.  They  hypothesize  that  a 
person  does  not  reject  another  on  the  basis  of  that  persons  race  alone, 
but  rather  rejects  others  who  are  perceived  as  having  beliefs  and  values 
different  from  his  own.  Prejudice  against  blacks  for  example,  occurs 
because  whites  think  blacks  differ  from  them  in  important  beliefs  and 
values. 

Research  has  been  done  to  test  Rokeach's  theory  (Byrne  Sc  Wong, 

1962;  Hendrick,  Bixentine  & Hawkins,  1971;  Insko  Sc  Robinson,  I$67; 

Johnson  & Johnson,  1972;  Rokeach  & Mezei,  1966;  Senn,  1971;  Stein,  1966; 
Stein,  Hardick,  & Smith,  1965;  Triandis,  1961;  Triandis  Sc  Davis,  1965)* 
The  usual  research  procedure  has  been  to  measure  subjects'  interpersonal 
preference  for  others  who  supposedly  differed  from  them  racially,  had 
different  beliefs  or  both.  Interpersonal  preference  was  measured 
through  the  use  of  social  distance  scales  ranging  from  "willingness  to 
work  with  the  stimulus  person  in  an  experiment"  to  "willingness  to  share 
an  apartment  with  that  person".  The  research  has  generally  been  suppor- 
tive of  the  theory.  When  information  about  the  stimulus  persons^  beliefs 
was  made  available,  this  information,  not  race  was  the  primary  determin- 
ant of  attitudes  toward  the  stimulus  person.  Research  has  also  shown 
that  race  is  the  primary  factor  In  determining  attitudes  toward  a per- 
son where  information  about  the  person ’3  beliefs  i3  not  available.  As 
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an  example.  If  a at  lamina  person  is  black,  white  subjects  assume  that  the 
person's  beliefs  are  different  from  theirs.  If  indeed  there  is  a tend- 
ency for  supervisors  to  rate  subordinates  of  the  same  race  more  favor- 
ably, the  reason  might  be  perceived  similarities  in  beliefs.  The  ques- 
tion is  why  should  knowledge  of  an  individuals  beliefs  affect  evalua- 
tions of  his  or  her  work  performance. 


There  are  several  theories  within  the  area  of  social  psychology 
which  appeal  to  central  factors  3uch  as  attitudes,  expectations,  know- 
ledges and  beliefs  as  basic  elements  in  the  explanation  of  behavior 
(Shaw  & Costanza,  1970).  Among  these  are  the  cognitive  consistency 
theories.  The  basic  premise  of  these  theories  is  that  inconsistent  cog- 
nitions lead  to  behaviors  directed  toward  the  achievement  of  cognitive 
consistency.  One  of  the  most  prominent  consistency  theories  is  Heider's 
balance  theory  (Heider,  1968). 

Heider  proposes  that  within  the  context  of  interpersonal  relations, 
entities  are  seen  as  forming  unit 3.  The  relationships  which  exist  be- 
tween entities  make  a unit  balanced  or  unbalanced.  Balance  theory  is 
concerned  with  the  sentiments  (affective  reactions)  of  a person  "p" 
toward  a person  "o",  and  an  impersonal  object  "x".  According  to  the 
theory,  separate  entities  such  as  "p",  "o"  and  "x"  constitute  a unit 
when  they  are  3een  as  belonging  together.  As  an  example,  if  person 
"p"  likes  both  person  "o"  and  an  entity  "x",  and  "o"  produced  "x",  then 
the  triad  is  said  to  be  balanced.  On  the  other  hand,  if  "p"  dislikes 
"o"  but  likes  "x",  and  "o"  produced  "x",  then  the  triad  i3  unbalanced. 
Heider  proposes  that  individuals  tend  to  prefer  balanced  over  unbalanced 


situations  because  unbalanced,  situations  create  tension.  Given  a par- 
ticular triad  p-o-x,  the  tendency  toward  balance  induces  relations  that 
result  in  a balanced  state.  If  for  example  there  exists  a situation 
where  "p"  dislikes  "o"  but  likes  "x",  and  "o"  produced  "x",  then  there 
is  a tendency  for  "p"  to  either  change  his  sentiment  toward  "o"  or 
change  the  positive  relation  of  "o"  produced  "x".  If  a situation  exists 
where  a supervisor  "p"  dislikes  a subordinate  "o"  but  values  good  Job 
performance  "x",  while  at  the  same  time  perceiving  that  "o"  produced 
"x",  the  triad  would  be  unbalanced.  One  way  to  reduce  the  imbalance 
would  be  to  deny  the  "o"  (subordinate)  produced  "x"  (good  Job  perform- 
ance) relationship.  This  example  describes  a relationship  between  a 
white  supervisor  and  a black  subordinate.  The  same  propositions  would 
apply  if  the  roles  were  reversed. 

Research  on  balance  theory  indicates  that  structures  consisting  of 
balanced  relations  are  perceived  as  more  pleasant  than  structures  con- 
sisting of  unbalanced  relations  (Morrisette,  1958;  Price,  Hamberg  & 
Ifewcomb,  1966).  However  there  does  not  appear  to  be  any  research  on 
the  application  of  balance  theory  to  performance  evaluation.  In  the 
present  research  an  attempt  is  made  to  relate  balance  theory  to  rating 
behavior. 


Purpose  of  the  Present  Investigation 

The  purpose  of  the  present  investigation  was  to  assess  the  effects 
of  race  of  rater  and  ratee  on  f avorabllity  of  performance  ratings  when 
the  quality  of  ratee  performance  was  systematically  controlled.  The 
question  wa3  whether  raters  would  be  more  lenient  when  rating  persons  of 
their  own  race.  In  previous  investigations  relevant  to  this  question 
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(Baas  & Turner,  19T3;  Hamner  et  al.,  197k),  the  quality  of  ratee  per- 
formance was  not  adequately  controlled.  Thi3  investigation  was  also 
aimed  at  identifying  those  processes  which  may  underly  racial  bias  in 


criterion  ratings. 


CHAPTER  II 


METHOD 

Subjects 

The  subjects  for  this  investigation  were  303  undergraduate  students 
from  the  University  of  Maryland,  College  Park  (9^  white  males,  79  black 
males,  80  white  females,  and  50  black  females).  The  college  experience 
of  the  students  i3  shown  in  Table  1.  This  data  could  only  be  obtained 
on  89^  of  the  subjects.  Comparisons  of  the  ratings  provided  by  the  four 
groups  indicated  no  significant  differences.  All  of  the  subjects  were 

volunteers. 


Procedure 

All  of  the  data  were  collected  prior  to  or  immediately  following 
the  students'  regularly  scheduled  classes.  The  students  were  enrolled 
in  undergraduate  psychology,  sociology,  and  Afro  —American  stud-.es 
courses.  The  data  were  collected  by  a black  researcher.  The  primary 
task  of  the  participants  was  to  review  and  evaluate  the  teaching  per- 
formance of  eight  hypothetical  instructors.  Information  about  each 
instructor  was  presented  in  an  8 I 11  inch  manl  1 la  envelope.  Each 
envelope  contained  a page  of  narrative  information  along  with  a 5 X 7 
inch  photograph. 

All  of  the  hypothetical  instructors  were  males  between  the  ages 
of  twenty-two  and  thirty.  Only  male  instructors  were  used  in  order  to 
narrow  the  focus  of  the  research  and  facilitate  clearer  interpretation 
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of  the  results.  Five  of  the  eight  instructors  were  white  and  three  were 
black.  This  was  done  in  order  not  to  give  away  the  main  thrust  of  the 
study,  which  was  to  compare  evaluations  of  black  and  white  Instructors. 
An  equal  number  of  black  and  white  Instructors  would  have  appeared 
highly  suspicious  since  such  proportions  rarely  exist  in  reality. 

The  narrative  information  about  each  instructor  consisted  of  a list 
of  ten  statements  describing  the  past  teaching  performance.  The  list 
of  statements  was  either: 

1.  Highly  favorable,  with  the  combined  list  having  a mean  value 
of  4.4  on  a five  point  scale; 

2.  Highly  unfavorable,  with  the  combined  list  having  a mean  value 
of  1.4  on  a five  point  scale,  or 

3.  .Neither  highly  favorable  nor  highly  unfavorable,  with  the 
combined  list  having  a mean  value  of  3*0  on  a five  point  scale. 

Scale  values  for  the  statements  were  computed  on  the  basis  of 
ratings  obtained  from  a sample  of  42  undergraduate  students.  These 
students  filled  out  the  questionnaire  shown  In  Appendix  1.  The  sample 
consisted  of  22  white  and  20  black  students.  In  order  to  assess  the 
consistency  of  the  ratings,  responses  from  one  group  were  correlated 
with  responses  from  the  other.  The  average  rating  assigned  to  each 
statement  was  used  for  this  comparison.  The  correlation  obtained  was 
•97.  As  an  additional  check,  the  two  groups  were  divided  and  ratings 
from  ten  white  students  were  combined  with  the  ratings  from  eleven 
black  students.  A correlation  of  .98  was  obtained.  These  correlations 
indicate  that  ratings  of  item  favorability  were  very  reliable.  Utiliz- 
ing the  scaled  statements,  three  levels  of  teaching  performance,  good, 
average,  and  poor  were  described.  There  were  two  white  instructors 
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and  one  black  Instructor  in  both  the  good  and  poor  performance  categories. 
The  average  performance  category  consisted  of  one  white  and  one  black  in- 
structor (Appendix  2). 

Is  addition  to  manipulating  the  race  and  performance  level  of  the 
instructors,  the  order  of  presentation  of  information  was  also  varied. 

The  design  is  shown  in  Table  2.  Table  2 shows  that  approximately  half 
of  the  subjects  were  presented  a photograph  followed  by  the  list  of 
statements.  The  remainder  of  the  students  were  presented  the  list  of 
statements  followed  by  the  photograph.  The  students  were  randomly 
assigned  to  each  of  these  groups.  They  were  required  to  evaluate  the 
performance  of  each  instructor  Immediately  after  reviewing  al  1 the  in- 
formation presented  about  him.  The  students  were  given  a special  rating 
farm  for  rating  the  instructors  (See  Appendix  3)*  2ach  rating  form 
was  filled  out  immediately  after  the  student  had  reviewed  information 
about  an  instructor.  As  shown  in  Appendix  3 , the  students  were  asked 
to  respond  to  tfce  following  questions  concerning  each  instructor. 

1.  Based  on  the  information  provided,  how  well  do  you  think 
this  instructor  performs  his  Job  7 

12  3 ^ 5 

very  poorly  satisfactorily  well  very 

poorly  well 

2.  On  the  basis  of  the  information  provided,  would  you  recom- 
mend thi3  instructor  for  promotion? 

12  3^5 

definitely  probably  uncertain  probably  definitely 

not  not 

In  addition  to  evaluating  the  teaching  performance  of  the  instruc- 
tors, perceived  belief  similarities  between  the  students  and  the  in- 
structors were  also  measured.  Students  were  asked  the  following  ques- 


tions regarding  the  instructors: 
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Table  2 

Order  of  Presentation  of  Information 


s 

Race 

Order 

84 

White 

Ph  N 

90 

White 

I?  Ph 

67 

Black 

Ph  N 

62 

Black 

N Ph 

Ph  - Photograph 
3 » Narrative  Information 
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3.  If  this  instructor  were  to  explain  to  you  his  philosophy 
about  teaching,  how  likely  do  you  think  it  is  that  you 
would  agree  with  hin? 


1 

2 

3 

4 

5 

would 

would 

would 

would 

would 

definitely 

probably 

neither 

probably 

definitely 

disagree 

disagree 

agree  nor 

agree 

agree 

disagree 


4.  Based  upon  the  information  provided  about  this  person,  how 
likely  do  you  think  it  is  that  you  would  agree  with  his 
beliefs  about  students? 


1 

2 

3 

4 

5 

would 

would 

would 

would 

would 

definitely 

probably 

neither 

probably 

definitely 

disagree 

disagree 

agree  nor 

agree 

agree 

disagree 


The  questions  above  were  designed  to  determine  the  extent  to  which  the 


students  assumed  that  their  beliefs  were  different  from  those  of  instruc- 
tors of  a different  race.  The  students  were  also  asked  the  following 


question  about  each  instructor. 

5.  If  this  instructor's  performance  changes  in  the  future,  in 
which  direction  do  you  think  the  change  will  occur? 


1 2 
his  his 

performance  performance 
will  decline  will  decline 
significantly  a little 


3 

I do  not 
think  it 
will  change 


4 

his 

performance 

will 

improve  a 
little 


5 

his 

performance 

will 

improve 

significantly 


la  order  to  elicit  the  full  cooperation  of  the  students,  their 
classroom  instructor  was  asked  to: 


(a)  inform  students  about  the  experiment  at  least  one  class 
period  ahead  of  time 

(b)  emphasise  that  participation  was  voluntary 

(c)  inform  the  students  that  experimental  sessions  would  last 
approximately  45  minutes 


1 
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Method  of  Data  Collection 


The  researcher  was  introduced  to  the  class  at  the  outset  of  each 

experimental  session.  The  researcher  then  said: 

A graduate  student  here  at  the  university  is  conducting  a 
study  of  the  processes  involved  in  teacher  evaluations.  As 
part  of  this  research,  we  have  been  going  out  to  students  and 
having  then  evaluate  the  teaching  performance  of  several 
fictitious  faculty  menfcers.  What  ire  would  like  to  do  today 
is  present  to  you  brief  descriptions  of  eight  persons  all 
of  whom  are  fictitious.  You  are  to  assume  however  that  all 
eight  persons  are  actual  faculty  members  at  a university. 

On  the  basis  of  brief  descriptions  of  these  persons,  which 
will  be  provided  to  you,  we  would  like  for  you  to  evaluate 
their  teaching  performance.  We  will  pass  out  to  each  of 
you,  eight  envelopes.  Each  envelope  will  contain  two  pages 
of  information  and  a rating  form.  One  of  the  pages  of  infor- 
mation will  consist  of  a list  of  short  statements  describ- 
ing the  manner  in  which  the  person  typically  conducts  him- 
self. The  other  page  consists  of  a photograph  of  the  person. 
When  the  envelopes  are  given  to  you,  we  would  like  for  you  to 
open  them  one  at  a time.  Once  yon  have  opened  an  envelope  you 
should  read  the  Information  contained  inside,  and  assuming  the 
person  were  a faculty  member,  use  the  rating  form  to  evaluate 
his  teaching  performance.  Do  you  have  any  questions  at  this 
point? 

We  will  pass  out  the  envelopes  now.  Please  do  not  open 
any  of  the  envelopes  until  you  are  told  to  do  so.  When  you 
are  told  to  begin,  you  should  (a)  Open  one  of  the  envelopes 
and  remove  the  information  inside:  (b)  Read  the  Information 
about  the  person  to  be  evaluated,  (c)  Once  you  have  read  the 
information  evaluate  the  person  utilizing  the  rating  form 
provided.  Remember  that  you  should  read  the  information 
about  the  instructor  before  attempting  to  evaluate  him.  It 
i3  important  that  you  do  not  go  back  over  any  of  the  infor- 
mation on  a page  once  you  have  read  it.  We  would  like  to 
get  your  initial  reaction  to  the  information.  Once  you  have 
filled  out  the  first  rating  form,  put  the  rating  form  and 
the  descriptive  information  back  into  the  envelope.  Set 
that  envelope  aside  and  take  another.  Repeat  steps  a 
through  c until  you  have  evaluated  all  eight  persons.  When 
you  have  finished,  we  will  pass  out  a short  questionnaire 
for  you  to  fill  out.  Do  you  have  any  questions?  You  may 
open  the  first  envelope  and  begin. 


Debriefing; 


After  all  of  the  hypothetical  instructors  had  been  evaluated,  each 
of  the  students  was  asked  to  fill  out  the  questionnaire  shown  in  Appen- 
dix 4.  The  purpose  of  the  questionnaire  was  tc  determine: 
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1.  whether  manipulation  of  the  race  of  the  hypothetical  instruc- 
tors was  effective 

2.  what  the  students  thought  about  as  they  rated  each  instructor 
After  the  questionnaires  had  been  completed  and  all  of  the  materials 
collected,  the  students  were  told  what  the  research  was  about.  They 
were  told  that  the  study  was  designed  to  assess  the  degree  to  which 
they  could  evaluate  each  instructor's  performance  solely  on  the  basis 
of  the  narrative  descriptions  without  regard  to  the  photographs.  The 
researcher  then  answered  all  questions  concerning  the  study. 

Research  Hypotheses 

Based  upon  the  research  of  ? laugher  et  al.  (1972)  and  Hamner  et  al. 
( 197*0  > the  following  hypotheses  were  made. 

Hypothesis  I.  Independent  of  performance  there  will  be  a sig- 
nificant correlation  between  the  racial  similarity  of 
rater  aid  ratee  and  evaluations  of  performance.  Ratings 
of  black  instructors  will  be  higher  when  rated  by  blacks 
aid  ratings  of  white  instructors  will  be  higher  when 
rated  by  whites. 

Hypothesis  II.  There  will  be  a significant  correlation  be- 
tween the  order  of  presentation  of  information  and  ratings 
of  performance.  The  photographs  will  act  as  negative  in- 
formation where  the  race  of  rater  and  ratee  are  not  the  3ame. 

The  following  hypothesis  follows  from  the  research  of  Byrne  and 

Wong  (1962),  Bixentine  and  Hawkins  (1971),  Insko  and  Robinson  (lt-67), 

Rokeach  aid  Miezer  (1967),  Rokeach,  Smith  and  Evans  (i960),  and  Sean 

\ 

(1971). 
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Hypothesis  IU.  Independent  of  performance,  there  will  "be  a 
significant  correlation  between  the  racial  combination  of 
rater  and  ratee  and  Judgements  of  belief  similarity.  The 
subjects  will  perceive  greater  belief  similarity  with 
hypothetical  instructors  of  the  same  race  than  with  in- 
structors of  a different  race. 

The  following  hypothesis  follows  from  the  research  of  Burastein 
(1967),  Jordan  (1953),  Morrisette  (1958),  and  Price,  Haoberg  and  New- 
comb (1966). 

Hypothesis  17.  Where  the  relationship  between  favorability  of 
information,  race  of  the  subject  and  race  of  the  instructor 
are  balanced,  there  will  be  significantly  less  expected 
future  change  in  teaching  performance.  Balanced  vs.  un- 
balanced structures  will  be  defined  as  follows: 

1.  white  subject  + white  instructor  + favorable  information 

» balanced 

2.  black  subject  + black  instructor  + favorable  information 

a balanced 

3.  white  subject  + black  instructor  + unfavorable  information 

a balanced 

k.  black  subject  + white  instructor  + unfavorable  information 
a balanced 

5.  white  subject  + black  instructor  + favorable  information 

a unbalanced 

6.  black  subject  + white  instructor  + favorable  information 

a unbalanced 

7.  white  subject  + white  instructor  +■  unfavorable  information 


unbalanced 
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8.  black  subject  + black  instructor  + unfavorable  information 


unbalanced 


CHAPTER  IU 
RESOETS 

Primary  Findings  and  Post  Hoc  Analyses 


Manipulation  of  the  Independent  Variables 

This  study  involved  the  manipulation  of  four  independent  variables 
(a)  the  race  of  the  hypothetical  instructor,  (b)  the  level  of  the  in- 
structor's performance,  (c)  the  order  in  which  the  information  was  pre- 
sented and  (d)  race  of  the  rater. 

Race  of  the  hypothetical  instructor.  Among  the  eight  hypothetical 
instructors,  five  were  white  and  three  were  black.  Manipulation  of  race 
was  accomplished  through  the  use  of  photographs.  The  effect  of  the 
manipulation  was  measured  by  the  students'  responses  to  the  following 
question:  "How  many  of  the  instructors  would  you  guess  could  be  clas- 

sified S3  members  of  a minority  group?"  This  question  was  asked  after 
all  of  the  hypothetical  instructors  had  been  evaluated  and  the  material 
describing  them  returned  to  the  original  envelope.  Thirty-seven  per- 
cent of  the  students  remembered  the  exact  number  of  black  instructors. 
Fifty-three  percent  of  the  students  indicated  the  correct  number  plus 
or  minus  one  (either  two  or  four).  The  remainder  of  the  students  in- 
dicated that  they  could  not  remember  the  number  of  black  instructors 
rated. 

The  level  of  the  instructor's  performance.  Table  3 shows  the  mean 
performance  ratings  given  to  instructors  who  were  described  as  good, 

average  or  poor  performers.  The  ratings  were  based  on  a five  point 
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Table  3 

Mean  Performance  Ratings  of  Good.,  Average 
and  Poor  Instructors 


Performance 

Level 

Race  of 
Instructor 

Mean  Performance  Rating 
Quality  of  Performance  Prcmotability 

High* 

White 

4.48 

4.28 

High 

Black 

4.50 

4.36 

Average 

White 

2.85 

2.69 

Average 

Black 

2.93 

2.98 

Low* 

White 

1.47 

1.46 

Low 

Black 

1.38 

1.44 

*The  average  rating  of  the  tvo  white  instructors. 
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scale  (questions  one  and  two.  Appendix  3).  It  can  be  seen  from  Table  3 
that  the  students  clearly  differentiated  between  good,  average,  and 
poor  instructors. 

Order  of  presentation  of  information.  Order  effects  were  hypothe- 
sized on  the  basis  of  the  assumptions  that  (a)  photographs  of  black 
instructors  represented  negative  information  to  white  raters  and  posi- 
tive information  to  black  raters,  and  (b)  photographs  of  white  instruc- 
tors represented  negative  information  to  black  raters  and  positive  in- 
formation to  white  raters.  Combining  the  photographs  with  highly  favor- 
able or  highly  unfavorable  narrative  information  was  expected  to  produce 
positive-positive,  positive- negative,  negative-positive,  or  negative- 
negative order  effects.  Observation  of  the  students  as  they  reviewed 
the  material  revealed  that  manipulation  of  the  order  of  presentation  of 
information  was  not  very  effective.  Many  students  ignored  the  order 
in  which  the  material  was  presented,  and  looked  at  the  photograph  prior 
to  reading  the  narrative  information. 

Race  of  rater.  Subjects  were  classified  as  white  or  black  solely 
on  the  basis  of  their  physical  characteristics.  At  no  time  were  the 
subjects  asked  to  provide  information  concerning  their  race.  However, 
a3  it  turned  out,  the  subjects  were  enrolled  in  classes  that  were  over- 
whelmingly white  or  overwhelmingly  black. 

Assessment  of  the  Interdependence  of  Ratings 

There  were  303  subjects,  each  of  whom  rated  (the  same)  eight  hypo- 
thetical instructors.  Ratings  of  the  two  white  instructors  in  the  high 
performance  category  were  combined  and  the  average  rating  used  for 
analysis.  This  wa3  also  done  for  the  two  white  instructors  in  the  poor 
performance  category.  As  a result,  there  were  six  observations  per 
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subject  instead  of  eight.  In  computing  the  correlations  discussed  in 
this  chapter,  each  rating  was  treated  as  an  independent  measure.  The 
fact  that  the  same  subject  produced  more  than  one  rating  was  ignored. 

As  an  example,  when  racial  similarity  between  rater  and  ratee  was  cor- 
related with  ratings  of  teaching  performance,  (using  the  total  sample) 
an  N of  I8l8  (303  X 6)  was  used.  This  was  done  only  after  determining 
that  there  were  no  significant  differences  between  subjects  with  regard 
to  rating  3tyle.  An  analysis  of  variance  with  repeated  measurements 
was  used  to  determine  whether  there  was  a significant  difference  between 
subjects  with  regard  to  the  type  of  ratings  given.  The  design  is  shown 
in  Figure  1.  The  analysis  was  done  separately  on  the  ratings  of  white 
males,  white  females,  black  males  and  black  females.  Table  4 shows  the  1 

results  of  white  males'  ratings  of  white  instructors.  The  dependent 
variable  was  how  well  the  students  thought  each  instructor  performed 
his  Job. 


5 

□ 

* Level  of  performance  =»  high,  medium,  and  lew 

Figure  1.  Analysis  of  variance  repeated  measurements  design. 
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Table  4 

Summary  Table  of  the  Performance  Ratings  of  White  Instructors 

by  White  Males 


sv 

df 

SS 

MS 

P 

Total 

282 

556 

S 

94 

26.4 

.28 

.82 

A 

2 

465-98 

233 

685* 

S/A 

186 

63.6 

•34 

*p  < .01 
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As  Table  4 shows,  performance  level  accounted  for  all  of  the  sig- 
nificant variance  in  performance  ratings.  There  were  no  significant 
subject  effects.  Table  5 shows  the  results  of  black  males'  ratings  of 
black  instructors  in  terms  of  quality  of  performance. 

As  Table  5 shows,  performance  level  again  accounted  for  all  of  the 
significant  variance  in  performance  ratings.  There  were  no  significant 
subject  effects.  The  results  were  similar  for  white  females  and  black 
females.  Performance  level  accounted  for  all  of  the  significant  var- 
iance in  performance  ratings.  Again  there  were  no  subject  effects. 

All  of  the  results  are  reported  on  the  basis  of  the  race  of  the 
rater.  Hone  of  the  results  are  reported  by  sex  of  rater.  That  is, 
white  raters  include  both  males  and  females.  Black  raters  also  include 
both  males  and  females.  This  was  done  after  comparing  male  and  female 
raters.  Zero  order  correlations  were  computed  (by  sex  of  the  rater) 
between  rater -rstee  racial  similarity  and  (a)  ratings  of  performance 
and,  (b)  ratings  of  belief  similarity.  The  correlations  for  female 
raters  were  compared  with  those  of  male  raters.  Hone  of  the  correla- 
tions was  significantly  different. 

Order  of  Presentation  of  Information  and  Performance  Ratings 

Partial  correlations  were  computed  between  order  of  presentation 
and  performance  ratings,  with  level  of  performance  and  race  of  the  ratee 
held  constant.  The  correlations  were  computed  on  the  basis  of  the  total 
sample  and  separately  for  black  and  white  raters.  Hone  of  the  correla- 
tions was  significant. 

Level  of  Performance  and  Performance  Ratings 

The  correlations  presented  in  Table  6 represent  the  relationship 
between  level  of  performance  (low,  average,  and  high)  and  performance 


*p  < .01 


Table  6 

Zero  Order  Correlations  Between  Level  of  Performance 
of  Performance  Ratings 


Race  of 
Rater 

Quality  of 
Performance 

Promotabllity 

White 

.92** 

.88* 

(1037) 

(1037) 

Black 

.85** 

.82** 

(757) 

(756)- 

Total 

.89** 

.85** 

(179*0 

(1793) 

**p  < .01 

Sote:  Sample  size  shown  In  parentheses 
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ratings.  Level  of  performance  accounted,  for  a large  proportion  of  the 
variance  (67^  - &5l>)  in  both  the  black  and  white  groups.  Since  the 
major  research  hypotheses  of  the  present  study  are  concerned  with  con- 
taminants beyond  these  relevant  aspects  of  performance,  level  of  per- 
formance will  be  partlaled  out  of  the  ratings  before  testing  the  hypo- 
theses. 

The  Relationship  Between  Rater-Ratee  Slwilarity  and  Performance  Ratings 
With  Other  Variables  Held  Constant 

Partial  correlations  were  computed  between  ratings  of  performance 
(questions  one  and  two  Appendix  3)  and  racial  similarity  between  rater 
and  ratee,  with  order  of  presentation  of  information  and  level  of  per- 
formance held  constant.  These  correlations  were  computed  for  both 
white  and  black  students,  and  the  total  sample.  Partial  correlations 
based  upon  the  total  sample  represent  the  degree  to  which  racial  simil- 
arity between  rater  and  rates  was  related  to  performance  ratings  without 
regard  to  the  specific  race  of  rater  or  ratee.  Duany  coding  (0  =»  dis- 
similar, 1 * similar)  was  used  to  designate  rater-ratee  similarity. 

The  dependent  variable  was  based  on  the  five  point  scales  shown  in 
Appendix  3-  A partial  correlation  of  .07  was  obtained  between  rater- 
ratee  similarity  and  ratings  of  bow  well  each  instructor  performed  his 
Job.  This  correlation  is  significant  at  the  .05  level  with  17?0  degrees 
of  freedom.  The  partial  correlation  between  rater /ratee  similarity  and 
ratings  of  promot ability  was  .Oh,  df  - 1789,  p < .05.  The  partial  cor- 
relations broken  down  by  race  of  rater  are  shown  in  Table  7-  These  cor- 
relations actually  represent  the  degree  of  relationship  between  the  in- 
structor's race  and  his  performance  rating,  since  the  race  of  the  rater 
was  held  constant. 


Table  7 

Second  Order  Partial  Correlations  Between  Race  of  Ratee  arxl 
Performance  Ratings  by  Race  of  Rater 


Race  of 
Rater 


Quality  of 
Performance 


Promo  tability 
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White  students  gave  higher  ratings  to  Instructors  of  the  sane  race 
when  evaluating  the  quality  of  the  instructors'  performance.  Black  stud- 
ents gave  higher  ratings  to  Instructors  of  the  sane  race  when  assessing 
the  instructors ' prccmotability. 

Racial  S-twHiarity  of  Rater  and  Ratee  Correlated  with  Ratings  of 
Belief  Similarity.  'v 

Based  upon  the  total  sample,  the  partial  correlation  between  rater- 
rat  ee  racial  similarity  and  agreement  with  the  instructor's  teaching 
philosophy  was  .05  (df  =•  1788,  p < .01)  with  the  order  of  presentation 
of  information  and  level  of  performance  held  constant.  The  partial  cor- 
relation between  racial  similarity  and  agreement  with  the  instructors 
beliefs  about  students  was  . C6  (df  * 1788,  p < .01).  The  same  corre- 
lations broken  down  by  race  of  the  rater  are  shown  in  Table  8. 

Table  8 shows  that  the  race  of  ratee  was  significantly  related,  to 
perceived  agreement  with  the  instructor's  beliefs  in  three  out  of  four 
cases.  In  all  three  cases  there  was  greater  agreement  with  the  beliefs 
of  instructors  of  the  same  race  as  the  rater. 

Expected  Change  In  Future  Performance 

In  addition  to  evaluating  teaching  performance,  the  students  pro- 
vided ratings  of  how  much  they  expected  each  instructor' 3 performance 
to  change.  The  responses  were  analysed  by  computing  the  average  rating 
given  to  each  instructor  by  white  and  black  students.  Comparisons  were 
made  between  balanced  and  unbalanced  situations  as  defined  on  page  28. 
The  results  are  shown  in  Table  9-  There  were  no  significant  differences 
between  the  ratings  based  upon  balanced -unbalanced  categorization. 
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Table  8 _ 

Second  Order  Partial  Correlations  Between  Race  of  the 
Instructor  and  Ratings  of  Belief  Similarity 
by  Race  of  Rater 


Race  of 

Philosophy  About 

Beliefs  About 

S 

Rater 

Teaching 

Students 

White 

.07** 

.05 

103? 

Black 

.08* 

.09** 

755 
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Table  9 


Average  Rating  of  Expected  Change  by  Race  and  Degree 
of  Cognitive  Balance 

Performance 

Level 

Race  of 
Rater 

Race  of 
Ratee 

Cognitive 

Definition 

X 

Rating 

High 

White 

White 

Balanced 

3-7^ 

High 

Black 

White 

Unbalanced 

3-83 

High 

Black 

Black 

Balanced 

4.04 

High 

White 

Black 

Unbalanced 

3.87 

Low 

White 

Black 

Balanced 

2.69 

low 

Black 

Black 

Unbalanced 

2.78 

Low 

Black 

White 

Balanced 

2.70 

Low 

White 

White 

Unbalanced 

2.fi2 

1 


k 
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Post  Hoc  Analyses 

Having  found  that  there  was  a significant  relationship  between 
rater-rat e«  racial  similarity  and  performance  ratings,  further  analyses 
were  conducted  to  determine  the  performance  level  at  vhich  the  relation- 
ship vas  strongest.  Ho  hypotheses  had  been  made  with  respect  to  these 
analyses. 

Partial  correlations  were  computed  between  rater-ratee  racial  simil- 
arity and  performance  ratings  for  each  level  of  teaching  performance, 
with  the  order  of  presentation  of  information  held  constant.  The  results 
are  shown  in  Table  10.  White  raters  gave  higher  ratings  (in  terms  of 
promotability ) to  average  black  instructors  than  to  average  white  instruc- 
tors. They  rated  poor  white  instructors  more  favorably  than  poor  black 
instructors  in  terms  of  quality  of  performance.  Black  raters  evaluated 
average  black  instructors  more  favorably  than  average  vhite  instructors 
both  with  respect  to  quality  of  performance  and  promotability.  Table  11 
shows  the  mean  ratings  for  each  level  of  teaching  performance  by  race 
of  rater  and  rates. 

Table  12  shovs  the  partial  correlation  between  race  of  the  instruc- 
tor and  ratings  of  belief  similarity  for  each  level  of  teaching  perform- 
ance. The  data  indicates  that  racial  similarity  of  rater  and  ratee 
was  related  to  perceived  belief  similarity  only  within  the  average  category 
for  white  raters  and  the  poor  category  for  black  raters.  White  raters 
indicated  greater  agreement  with  the  beliefs  of  a mediocre  white  instruc- 
tor than  a similar  black  instructor.  Black  raters  showed  greater  agree- 
ment with  the  beliefs  of  an  average  black  instructor  than  a similar  white 
instructor.  Table  13  shows  the  mean  ratings  of  belief  similarity  for 
each  level  of  teaching  performance  by  race  of  rater  and  ratee. 
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Table  10 


Partial  Correlations  Between  Instructor’s  Race  and  Performance 
Ratings  by  Level  of  Performance 


Performance  Level 

Type  of 
Rating 

r 

Whites 
(IT  - 348) 

Slacks 
(IT  - 249) 

Quality  of 
Performance 

.01 

.05 

High 

Promotability 

-.03 

.08 

Quality  of 
Performance 

.04 

• 13* 

Average 

Promotability 

-.11* 

.22** 

Quality  of 
Performance 

.17** 

-.02 

Low 

Promotability 

.08 

.03 

*p  < .05 

**p  < .01 


Table  11 
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Table  12 

Partial  Correlations  Between  Race  of  Ratee  and  Ratings  of  Belief 
Similarity  for  Each  Level  of  Teaching  Performance 


Level  of 
Performance 


Type  of  Rating 


Whites 
(JT  » 348) 


Blacks 
(N  - 249) 


Agreement  with  In at rue - 
tors'  Philosophy  about 
Teaching 


-.05 


.09 


High 


Agreement  with  Instruc- 
tors' Philosophy  about 

.04 

.06 

Students 

Agreement  with  Instruc- 
tors’ Philosophy  about 
Teaching 

.01 

.18* 

Average 

Agreement  with  Instruc- 
tors' Philosophy  about 

.03 

.22* 

Student 3 

Agreement  with  Instruc- 
tors’ Philosophy  about 

Teaching 

.12* 

-.05 

Low 

Agreement  with  Instruc- 
tors’ Philosophy  about 

.15* 

-.06 

Students 

1 

*p  < .01 
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Process  Variables 

The  questionnaires  completed  after  all  of  the  instructors  had  been 
rated  were  designed  to  measure  those  process  variables  which  may  have 
influenced  the  students'  ratings.  Questions  1,  7 and.  11,  (Appendix  4), 
were  specifically  designed  for  this  purpose. 

Question  7 was  designed  to  determine  whether  there  were  differ- 
ences between  black  and  white  students  in  terms  of  their  criteria  for 
evaluating  teaching  performance.  The  question  was,  "What  do  you  think 
is  the  most  important  thing  for  an  instructor  to  accomplish  in  teaching 
a course".  Content  analysis  of  the  responses  yielded  the  results  shown 
in  Table  14.  A3  Table  14  shows,  both  groups  felt  that  the  primary  goal 
of  an  instructor  should  be  to  teach  in  a manner  which  helps  students 
understand  the  material.  Black  students  also  felt  that  an  instructor 
should  attempt  to  develop  a good  relationship  with  students,  by  trying 
to  understand  them  and  show  consideration  for  their  needs.  White  students 
felt  it  was  important  for  an  instructor  to  present  the  material  in  an 
interesting  manner  so  that  students  want  to  learn.  In  general,  white 
students  provided  a greater  variety  of  responses  than  black  students. 

Table  15  shows  students'  responses  to  the  question,  "As  you  were 
actually  rating  the  instructors,  what  thoughts  went  through  your  mind". 
Table  15  3hows  that  most  of  the  students  thought  about  the  consequences 
of  being  taught  by  each  instructor  as  they  evaluated  him.  The  primary 
concern  seemed  to  be  the  consequences  each  instructor 1 s performance 
would  have  0n  his  students.  Thoughts  along  this  line  vere  apparently 
crystalined  by  associating  the  individual  being  rated  with  past  instruc- 
tors. A fair  percentage  of  the  students  wondered  about  the  real  pur- 
pose of  the  experiment. 
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What  Students  Thought  Most  Important  for  an  Instructor  to  Accomplish 
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Get  feedback  from  students 


Student  a*  Thoughts  as  They  Evaluated  the  Instructors 


Whether  or  not  the  Instructor  will  change 
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Question  one  was  designed  to  measure  the  students'  "guesses"  as 
to  what  the  research  was  actually  about.  Approximately  19  percent  of 
the  white  students  and  5 percent  of  the  blech  students  thought  the  re- 
search "had  something  to  do  with"  comparing  the  ratings  of  white  and 
blech  instructors. 

Summary  of  Data  Supportive  and  Non-Support ive  of 
the  Research  Hypotheses 

Hypothesis  I.  Independent  of  the  level  of  performance  there  will 
be  a significant  correlation  between  the  racial  similarity  of  rater  and 
ratee  and  evaluations  of  performance . Ratings  of  black  instructors  will 
be  higher  when  rated  by  blacks  and  ratings  of  white  instructors  will  be 
higher  when  rated  by  whites. 

£fypothe3is  I was  supported  by  the  results  from  this  study.  Evalua- 
tions of  teaching  performance  were  related  to  the  racial  similarity  of 
rater  and  ratee.  However,  racial  similarity  of  rater  and  ratee  account- 
ed for  less  than  one  percent  of  the  variance  in  ratings  of  teaching 
quality  or  premat ability. 

Hypothesis  II.  There  will  be  a significant  correlation  between  the 
order  of  presentation  of  information  and  ratings  of  performance.  The 
photographs  will  act  aa  negative  information  where  the  race  of  the  in- 
structor and  the  race  of  the  rater  are  net  the  same.  Hypothesis  II  was 
not  supported  by  the  data  obtained  from  this  study. 

Hypothesis  III.  Independent  of  the  level  of  performance  there  will 
be  a significant  correlation  between  the  racial  similarity  of  rater  and 
ratee  and  Judgments  of  belief  similarity.  Subjects  will  perceive  greater 
belief  similarity  with  hypothetical  instructors  of  the  3ame  race  than 


with  other  instructors. 
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Hypothesis  IU  was  supported  by  the  data  from  tills  research.  The 
degree  to  which  students  thought  their  beliefs  and  those  of  an  instruc- 
tor were  similar  was  related  to  the  instructor's  race.  However,  racial 
similarity  of  rater  and  ratee  accounted  for  less  than  one  percent  of 
the  variance  In  ratings  of  agreement  with  teaching  philosophy  or  beliefs 
about  students. 

Hypothesis  17.  Where  the  relationship  between  fsvorability  of  in- 
formation, race  of  the  subject  and  the  instructor's  race  is  balanced, 
there  will  be  significantly  less  expected  future  change  in  the  instruc- 
tor's teaching  performance.  Manipulation  of  the  independent  variable 
was  not  sufficient  to  test  this  hypothesis.  That  is,  the  subjects  did 
not  seem  to  form  cognitive  triads  on  the  basis  of  each  instructor's  race. 
Responses  to  the  debriefing  questionnaire  indicated  the  consequences 
each  instructor's  teaching  performance  would  have  on  students  was  the 
most  salient  factor  influencing  the  formation  of  cognitions  about  him. 


CHAPTER  IV 


DISCUSSION 

Before  discussing  the  results  of  this  research,  several  methodolog- 
ical issues  should  be  addressed.  First,  the  effects  of  the  race  of  the 
researcher  are  not  known.  Since  only  a black  researcher  was  present  at 
the  time  the  data  was  collected,  experimenter  effects  may  have  been 
operative.  The  subjects'  rating  behavior  may  have  been  different  had 
a team  of  researchers  (one  white  and  one  black)  collected  the  data.  It 
is  also  important  to  note  that  the  levels  of  teaching  performance  were 
for  the  most  part  extreme.  Descriptions  of  good  instructors  left  little 
doubt  as  to  the  quality  of  their  teaching  performance  as  did  descrip- 
tions of  poor  instructors.  Since  such  wide  variations  in  the  perform- 
ance ratings  were  experimentally  induced,  there  may  have  been  little 
opportunity  for  other  effects  to  have  a major  impact.  A third  method- 
ological consideration  is  that  the  subjects  were  not  selected  at  random. 
The  participants  were  volunteers  from  existing  college  classes.  The 
degree  to  which  their  rating  behavior  was  typical  of  college  students 
in  general  is  not  known.  Moreover,  the  generalizability  of  the  ie3uit3 
of  this  research  to  non-educational  organizations  i3  also  unknown.  Taken 
singly,  or  in  total,  the  factors  outlined  above  might  have  influenced 
the  outcomes  of  this  research.  In  pointing  out  these  factors,  the  in- 
tent is  not  to  detract  from  the  validity  of  the  findings,  but  rather 
to  facilitate  appropriate  interpretation  of  the  data. 
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The  results  of  this  study  Indicate  that  teaching  performance  was 
the  most  important  variable  affecting  rating  behavior.  When  the  entire 
range  of  teaching  performance  is  considered,  race  effects  were  small. 

This  could  have  been  expected  to  some  degree,  since  descriptions  of 
teaching  performance  in  the  good  and  poor  categories  were  extreme . 
nevertheless,  on  the  basis  of  the  total  sample,  level  of  performance 
accounted  for  79 of  the  variance  in  ratings  of  teaching  quality  and. 

72^  of  the  variance  in  ratings  of  promotability.  These  findings  are 
corroborated  by  the  fact  that  less  than  50^  of  the  subjects  indicated 
remembering  the  exact  number  of  black  instructors  rated.  The  data  do 
not  however  support  the  conclusion  that  there  were  no  race  effects. 

hypothesis  I was  supported  by  the  results  in  Table  J.  Racial  simil- 
arity of  rater  and  ratee  was  significantly  related  to  performance  ratings 
for  both  white  and  black  subjects  when  the  level  of  teaching  performance 
was  statistically  controlled.  Higher  ratings  were  given  when  the  rater 
and  ratee  belonged  to  the  same  race.  This  finding  is  supported  by  the 
research  of  ? laugher  et  al.  (1969)-  The  race  effects  shown  in  Table  7 
are  so  small  however  that  use  of  controls  such  as  forced  choice  rating 
scales  or  preliminary  training  for  the  raters  might  have  completely 
eliminated  rater  bias  altogether.  The  basic  finding  for  both  white  and 
black  subjects  was  that  when  given  relevant  information,  the  subjects 
could  accurately  evaluate  performance.  The  effects  of  irrelevant  infor- 
mation such  as  the  race  of  the  ratee  were  minimal. 

Hypothesis  II  was  not  sufficiently  tested  by  this  study.  It  i3  not 
surprising  that  order  of  presentation  of  stimulus  information  was  insig- 
nificant, since  observation  of  the  raters  revealed  that  manipulation  of 
this  variable  was  not  effective.  Many  students  ignored  the  order  in 
which  the  material  wa3  presented  and  looked  at  the  photographs  prior  to 
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evaluating  each  instructor.  In  future  research  It  would  be  better  to 
have  the  information  about  each  ratee  presented  on  film  or  slides  thus 
allowing  greater  control  over  order  of  presentation. 

hypothesis  HI  is  supported  by  the  results  in  Table  8.  Ratings 
of  belief  similarity  were  related  to  the  racial  similarity  of  rater 
and  ratee  for  both  black  and  white  subjects  when  level  of  teaching  per- 
formance was  statistically  controlled.  Raters  assumed  that  their 
beliefs  were  more  similar  to  those  of  an  instructor  of  the  same  race 
than  an  instructor  of  a different  race.  However,  ratings  of  belief 
similarity  closely  paralleled  ratings  of  teaching  performance  and  may 
simply  represent  the  halo  effect.  Despite  this  possibility,  ratings 
of  belief  similarity  are  especially  interesting  since  no  information 
was  given  about  the  beliefs  of  any  of  the  instructors.  The  subjects 
apparently  made  assumptions  about  each  instructor's  beliefs  without 
relevant  information.  The  assumptions  were  based  to  some  extent  on  each 
instructor's  race.  This  finding  is  supported  by  the  research  of  Byrne 
and  Wong  (l97l)>  Inako  and  Robinson  (1967)  Rokeach  and  Miezer  (1967), 
Rokeach,  Smith  and  Evans  (l$6o),  and  Senn  (1971)- 

Rypothesi3  IV  was  not  adequately  tested  by  this  research.  The  de- 
gree of  cognitive  balance,  as  defined  in  this  study,  was  not  sufficient- 
ly manipulated  to  determine  the  applicability  of  balance  theory  to  rater 
bias.  This  was  probably  due  to  the  fact  that  in  forming  cognitions 
about  each  instructor,  the  students  gave  more  weight  to  the  consequences 
of  an  instructor' 3 behavior  than  to  his  race.  The  students  apparently 
did  not  experience  positive  or  negative  sentiments  toward  an  instructor 
solely  on  the  basis  of  hi3  race.  This  was  the  assumption  underlying  the 
definitions  of  balance  employed  here.  The  primary  concern  of  the  stud- 


ents seemed  to  be  the  consequences  each  instructor's  behavior  would 
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hare  for  students.  Bloch  raters,  for  eaample,  did  not  necessarily  per- 
ceive as  unbalanced,  situations  where  a black  instructor  was  described 
unfavorably  or  a white  instructor  described  favorably.  In  addition, 
white  raters  did  not  necessarily  perceive  as  unbalanced,  situations 
where  a white  instructor  was  described  unfavorably  or  a black  instructor 
described  favorably. 

Implications  for  Future  Research 

Post  hoc  analysis  of  the  data  yielded  some  useful  information. 

Apparently  there  were  no  significant  race  effects  where  teaching  perform- 
ance was  described  favorably;  but  where  teaching  performance  was  de- 
scribed as  average,  white  raters  gave  higher  ratings  to  black  instruc- 
tors than  to  white  instructors  in  terms  of  promotability . White  raters 
also  gave  poor  white  Instructors  higher  ratings  than  poor  black  instruc- 
tors in  terms  of  quality  of  performance.  Black  raters  gave  higher  ratings 
to  average  black  instructors  than  to  average  white  instructors  both  in 
terms  of  quality  of  performance  and  promotability.  Apparently  white 
raters  were  influenced  by  the  race  of  the  ratee  only  where  teaching  per- 
formance was  not  clearly  outstanding.  The  magnitude  and  direction  of 
the  relationship  seems  to  have  depended  on  a number  of  factors  including 
the  rating  situation  and  the  performance  dimension  to  be  evaluated. 

White  raters  may  have  been  more  willing  to  promote  an  average  black 
instructor  than  an  average  white  instructor  for  any  one  or  all  of  the 
fallowing  reasons:  (a)  This  performance  dimension  was  more  subjective 
than  the  others;  (b)  These  raters  were  more  willing  to  give  a black 
instructor  the  benefit  of  the  doubt;  (c)  Given  equivalent  teaching  abil- 
ity, the  raters  felt  it  more  important  from  an  affirmative  action  stand- 
point to  promote  a black  instructor;  (d)  The  raters  were  over-zealous 
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In  their  attempts  to  be  fair;  (e)  A a long  as  teaching  performance  vas 
not  too  poor,  the  raters  attempted  to  rate  black  Instructors  as  high 
or  higher  than  vhlte  Instructors  In  order  to  demonstrate  that  they 
were  not  prejudiced.  Any  of  these  factors  may  hav's  accounted  for  the 
white  raters’  evaluations  of  average  instructors.  All  of  them  are  in- 
teresting topics  for  future  research. 

The  factors  underlying  white  raters’  evaluations  of  poor  instruc- 
tors are  also  open  to  speculation.  They  may  have  rated  poor  white  in- 
structors more  favorably  than  poor  black  instructors  because  they  were 
less  willing  to  accept  extremely  negative  information  about  individuals 
of  their  own  race.  That  is,  extremely  negative  descriptions  were  prob- 
ably sore  in  line  with  white  raters’  expectations  about  black  instructors. 
The  validity  of  this  hypothesis  can  only  be  determined  through  future 
research. 

Black  raters  apparently  were  not  influenced  by  the  race  of  the  in- 
structor when  rating  very  good  or  very  poor  instructors.  However,  when 
evaluating  average  performance,  black  raters  favored  black  instructors 
over  white  instructors  both  in  terms  of  teaching  performance  and  promot- 
ability.  The  rationale  for  their  behavior  cannot  be  determined  on  the 
basis  of  the  data.  They  may  have  felt  that  an  average  black  instructor 
deserved  higher  ratings  than  an  average  white  instructor  by  virtue  of 
the  obstacles  he  must  face  in  teaching  at  a predominantly  white  univer- 
sity. 

In  summary  then,  the  degree  of  rater  bias  for  both  white  aod  black 
subjects  seems  to  have  been  related  to  the  overall  quality  of  ratee 
performance.  This  finding  is  supported  by  the  research  of  F laugher  et  al. 
(1969),  Hamner  et  al.  (197M,  Bigoness  (1976),  Md  Rotter  and  Rotter 
(reported  in  Bigoness,  1976).  Rotter  and  Rotter  found  that  low  performing 
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blacks  were  rated  significantly  higher  than  low  performing  whites,  while 
high  performing  blacks  and  whites  were  rated  similarly.  Bigoness  (1976) 
found  that  while  low  performing  males  and  females  were  rated  similarly, 
high  performing  females  were  rated  significantly  higher  than  high  per- 
forming males.  Deaux  and  Taynor  (1973)  found  that  highly  competent  males 
were  rated  more  favorably  than  highly  competent  females;  but  males  of  low 
competence  were  rated  lower  than  similar  females. 

Conclusions 

The  results  of  this  study  indicate  that  racial  bias  influenced  per- 
formance ratings,  but  only  to  a limited  degree.  The  subjects  in  this 
study  evaluated  teaching  behavior  primarily  on  the  basis  of  relevant 
information  about  teaching  performance.  Teaching  performance  also  in- 
fluenced ratings  of  perceived  attitude  similarity.  Other  research  has 
shown  that  the  sex  of  the  rater  and  ratee  may  influence  rating  behavior. 
However,  more  research  must  be  done  before  a parsimonious  explanation 
of  the  effects  of  rater-ratee  characteristics  on  performance  evaluation; 
can  be  offered.  It  is  clear  that  in  future  research  greater  efforts 
must  be  made  to  assess  not  only  the  presence  or  absence  of  rater  bias, 
but  also  those  factors  which  may  account  for  it. 


APPENDIX  1 


TEACHER  EVALUATION  QUESTIONNAIRE 
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As  part  of  my  doctoral  dissertation  research,  I am  conducting  a 
study  involving  an  evaluation  of  the  teaching  performance  of  faculty 
members.  As  part  of  this  project  I would  like  to  see  how  good  or  bad 
you  think  some  of  the  things  are  that  instructors  do.  Below  is  a list 
of  statements  describing  the  types  of  things  that  instructors  do.  Read 
each  statement  and  indicate  how  good  or  bad  you  think  the  behavior  de- 
scribed in  the  statement  is  in  terms  of  contributing  to  effective  teach- 
ing performance.  Please  use  the  following  five  point  scale  to  rate 
each  item. 

1 = Very  bad,  something  an  instructor  should  definitely  not  do. 

2 = Poor,  something  an  instructor  should  try  to  avoid  doing. 

3 = Does  not  matter,  something  that  really  does  not  matter  one 

way  or  the  other. 

h = Good,  something  an  instructor  should  try  to  do  whenever 
possible. 

5 =»  Excellent,  something  an  instructor  should  definitely  do. 

1.  students  usually  cannot  see  as  he  writes  on  the  board 

2.  lectures  in  a low  monotone 

3.  is  ill  informed  about  the  topics  in  the  course 

k.  asks  for  more  details  vhen  he  does  not  understand  what  a 

student  says 

5«  invites  students  to  make  suggestions  about  the  course 

6.  introduces  many  ideas  during  each  class 

7*  starts  class  on  time 

8.  gives. concise  explanations 

9.  is  enthusiastic 

10.  invites  students  to  ask  questions 

11.  tells  students  which  material  they  will  not  be  tested  on 

12 . provides  students  with  the  opportunity  to  apply  newly  acquired 

knowledge. 
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13.  synthesizes  or  integrates  the  material  effectively 

gives  students  written  comments  about  their  performance 

15  • gives  tests  which  do  not  give  students  the  opportunity  to 
show  what  they  have  learned 

16.  does  not  give  enough  exams 

17.  expresses  his  appreciation  when  a student  does  a good  Job 

18.  tells  the  students  what  parts  of  the  material  are  the  most 
important 

19.  gives  too  many  exams 

20.  addresses  students  by  name 

_21.  can  tell  if  the  class  i3  understanding  him  or  not 

22.  is  usually  incoherent  or  vague  in  what  he  is  saying 

23 . tells  students  at  the  beginning  of  class  what  he  is  going  to 
tali  about 

2K  explains  how  the  topics  in  the  course  are  related  to  each  other 

25.  makes  class  time  seem  to  pass  quickly 

26.  offers  topics  which  are  usually  relevant  to  students'  educa- 
tional goals 

_27*  has  a well  developed  plan  for  each  class  session 

28.  gives  clear  explanations 

29.  is  impartial  in  grading  students 

_30.  makes  students  feel  at  ease  when  talking  with  him 
J31.  post  office  hours 

_32.  answers  thoroughly  all  questions  which  arise  during  the  lecture 
_33-  uses  a variety  of  teaching  techniques 
_3^.  seldom  indicates  when  a new  topic  is  being  introduced 
35*  is  harshly  critical  of  student  responses 
_36.  uses  understandable  vocabulary 
_37-  hi 3 lectures  follow  a logical  sequence 
38.  provides  assistance  outside  of  class 
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_39-  has  favorite  students  with  whom  he  spends  more  time  than  others 

_40.  lectures  in  such  a way  that  the  textbook  is  easier  to  under- 
stand 

bl,  presents  the  origins  of  ideas  and/or  concepts 
Jt2.  his  lectures  make  note-taking  easy 
J+3 . presents  examples  of  what  he  wants  students  to  dc 
_44.  class  presentations  are  well  prepared 
J*5«  ha3  no  interest  in  the  quality  of  hi3  teaching 
_k6.  does  not  utilize  the  blackboard  very  well 
bj.  is  tolerant  of  students'  mistakes 
bQ.  presents  an  outline  of  the  course 

J*9«  has  only  a superficial  knowledge  of  the  subject  matter 

_50.  discusses  examination  questions  and  answers  before  the  exams 

_51.  tells  students  what  has  to  be  done  to  get  a good  grade  in 
the  course 

_52.  is  not  responsive  to  the  students'  comments  and  questions  in 
class 

_53-  does  not  encourage  silent  students  to  participate  1a  class 

_5b.  gives  exam  questions  that  are  very  picky 

_55«  uses  examples  which  aid  learning  of  the  course  material 

_5 6.  tries  to  force  his  own  ideas  on  the  student 

_57.  summarizes  the  material  in  a way  that  aids  retention 

_58-  fails  to  differentiate  between  significant  and  insignif leant 
materials 

_59-  shows  favoratism  (ha3  favorites)  when  he  is  grading 
_60.  frequently  subjects  3tudent3  to  riducule  and  sarcasm 
_6l.  Is  frequently  distracted  from  the  main  point  while  lecturing 
_62 . finds  ways  to  help  students  answer  their  own  questions 
_63.  discusses  the  activities  persona  in  his  profession  engage  in 
6b. 


explains  the  grading  system 
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65 • Is  punctual 

66.  attempts  to  cover  too  much  aiaterial 

67.  adjusts  his  pace  to  the  needs  of  the  class 

68.  does  not  weight  tests,  quizes  and  reports  fairly  in  determin- 
ing grades 

69.  invites  students  to  share  their  knowledge  and  experiences 
TO.  t^J  level  of  the  lectures  is  too  difficult 

Tl.  does  not  organize  course  objectives 

J2.  asks  confusing  questions  on  his  tests 

73.  introduces  stimulating  ideas  about  the  subject 

Jk.  admits  when  he  has  made  a mistake 

75 • grades  testa  too  strictly 

76.  seens  to  lack  self  confidence 

_7T.  does  not  return  tests  within  a reasonable  time. 

j8.  treats  all  students  equally 

79.  does  not  grade  hl3  tests  fairly 

_8o.  covers  the  material  in  the  course  too  quickly 

_8l.  presents  recent  developments  in  the  field 

82.  ends  class  on  time 

_83.  rephrases  statements  the  students  do  not  understand 
_8h . asks  silent  students  to  participate 

85.  knows  the  subject  matter 

86.  does  not  offer  students  freedom  to  express  their  opinions  in 
class 

Qj.  admits  when  he  does  not  know  the  answer  to  a question 

_88.  does  not  show  tolerance  for  different  points  of  view 

89.  states  when  a new  topic  i3  being  introduced 

J?0.  changes  his  approach  to  meet  nev  situations 

j?  1.  never  discusses  test  questions  after  the  tests  have  been 
returned 
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_92 • presents  up-to-date  information 

__93.  sllovs  students  to  express  their  problems  related  to  the  course 
_94.  varies  the  speed  and  tone  of  his  voice  when  lecturing 
_95-  wastes  a lot  of  time  in  class 

_j?6.  uses  students  questions  as  an  opportunity  for  class  discussion 

_97 • invites  criticism  of  the  ideas  he  presents 

_J?8.  tends  to  discourage  questions  in  class 

_99-  spends  too  much  time  in  class  dwelling  on  the  obvious 

100.  gives  exams  which  do  not  have  a balanced  coverage  of  the 
major  topics 

101.  lectures  in  a rambling  fashion 

1C2.  does  not  discuss  exam  questions  after  exams  have  been  returned 

103-  is  usually  disrespectful  towards  students 

104.  introduces  topics  by  telling  how  they  relate  to  topics  that 
have  already  been  discussed 

105-  provides  informative  appraisals  of  students ’ work 

106.  does  not  grade  tests  fairly 

107  • talks  to  the  blackboard  when  lecturing 

108.  is  not  receptive  to  students'  viewpoints 

109.  tries  to  force  his  own  ideas  on  the  students 

110.  is  harshly  critical  of  student  responses 

111.  seems  to  lack  self  confidence 

112.  has  only  superficial  knowledge  of  the  subject  matter 

113.  gives  exams  which  do  not  have  a balanced  coverage  of  the  major 
topics 

.114.  answers  questions  directly  and  to  the  point 

.115-  the  way  he  covers  the  material  in  class  makes  it  boring 
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A Very  Good  Instructor 
He  give 8 clear  explanations. 

Os  rephrases  statements  the  students  do  not  understand. 

Hie  provides  assistance  outside  of  class. 

He  synthesizes  or  integrates  the  material  effectively. 

He  can  tell  if  the  class  is  understanding  him  or  not. 

He  gives  class  presentations  which  are  well  prepared. 

He  invites  students  to  make  suggestions  about  the  course. 
He  invites  students  to  ask  questions. 

He  is  punctual. 

He  presents  recent  developments  in  the  field. 


h - 


68 


A Very  Good  Instructor 
He  knows  the  subject  matter. 

He  presents  up-to-date  Information. 

Hie  uses  examples  which  aid  learning  of  the  course  material. 
He  presents  examples  of  what  he  wants  students  to  do. 

He  is  tolerant  of  students'  mistakes. 

He  treats  all  students  equally. 

He  presents  an  outline  of  the  course. 

His  lectures  make  note -taking  easy. 

His  lectures  follow  a logical  sequence. 

Hie  is  enthusiastic. 


L. 
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A Very  Good  Instructor 
He  explains  the  grading  system. 

Hie  allows  students  to  express  their  problems  related  to  the  course. 

He  has  a well  developed  plan  for  each  class  session. 

Hie  gives  concise  explanations. 

He  adjusts  his  pace  to  the  needs  of  the  class. 

Hie  ends  class  on  time . 

He  tells  the  staienta  which  parts  of  the  material  are  most  important. 
He  post 3 office  hours. 

He  explains  how  the  topics  in  the  course  are  related  to  each  other. 

He  makes  students  feel  at  ease  when  talking  with  him. 
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An  Average  Instructor 

He  tells  students  what  has  to  be  done  to  get  a good  grade  In  the  course. 

He  gives  students  written  comments  about  their  performance. 

He  does  not  utilise  the  blackboard  very  well. 

He  does  not  give  enough  exams. 

He  is  frequently  distracted  *r<~m  the  main  point  while  lecturing. 

He  answers  thoroughly  all  questions  which  arise  during  the  lecture. 

He  provides  students  with  the  opportunity  to  apply  newly  acquired  knowledge. 
He  uses  a variety  of  teaching  techniques. 

He  does  not  encourage  silent  students  to  participate  in  class. 

He  tries  to  force  his  own  ideas  on  the  students. 
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An  Average  Instructor 

Hie  expresses  his  appreciation  when  a student  does  a good  Job. 
He  starts  class  on  time. 

He  asks  silent  students  to  participate. 

Hie  attempts  to  cover  too  much  material. 

He  gives  exea  questions  that  are  very  picky. 

Hie  admits  when  he  has  made  a mistake. 

He  uses  understandable  vocabulary. 

The  level  of  his  lectures  is  too  difficult. 

He  gives  too  many  exams. 


He  grades  tests  too  strictly. 


r 
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A Poor  Instructor 

He  covers  the  material  la  the  course  too  quickly. 

He  does  not  show  tolerance  for  different  points  of  view. 

He  does  not  weight  tests,  quizzes  and  reports  fairly  in  determining 
grades. 

He  has  only  superficial  knowledge  of  the  subject  matter. 

He  is  usually  disrespectful  towards  students. 

He  seldom  indicates  when  a new  topic  i3  being  introduced. 

He  is  not  receptive  to  students'  viewpoints. 

The  way  he  covers  the  material  in  class  makes  it  boring. 

He  asks  confusing  questions  on  his  tests. 

He  does  not  grade  his  tests  fairly. 


78 

A Poor  Instructor 

He  does  not  offer  students  freedom  to  express  their  opinions  in  class. 

He  does  not  organise  course  objectives. 

He  is  ill-informed  about  the  topics  In  the  course. 

He  is  usually  incoherent  or  vague  in  what  he  is  saying. 

He  is  harshly  critical  of  student  responses. 

He  fails  to  differentiate  between  significant  and  insignificant  material. 
Students  usually  cannot  see  what  he  writes  on  the  board. 

He  does  not  return  tests  within  a reasonable  time. 

He  has  no  interest  in  the  quality  of  his  teaching. 

He  shows  favoratism  (has  favorites)  when  he  is  grading. 
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A Poor  Instructor 

He  has  favorite  students  with  whoa  he  spends  more  time  than  others. 

Hie  wastes  a lot  of  time  in  class. 

He  gives  tests  which  do  not  give  students  the  opportunity  to  show 
what  they  have  learned. 

He  lectures  in  a rambling  fashion. 

Hie  frequently  subjects  students  to  ridicule  and  sarcasm. 

He  talks  to  the  blackboard  when  lecturing. 

He  gives  exams  which  do  not  have  a balanced  coverage  of  the  major  topics 
He  lectures  in  a low  monotone. 

He  does  not  discuss  exam  questions  after  exams  have  been  returned. 

He  is  not  responsive  to  students'  comments  and  questions  in  class. 
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On  the  basis  of  the  information  given  to  you  about  this  person, 
please  answer  the  following  questions. 

1.  Based  on  the  information  provided,  how  well  do  you  think  this  in- 
structor performs  his  job?  (circle  one) 


1 

very 

poorly 


2 3 

poorly  satisfactorily 


k 

well 


5 

very 

well 


2.  On  the  basis  of  the  information  provided,  would  you  recommend  this 
instructor  fbr  promotion?  (Circle  one) 


definitely 

not 


2 

probably 

not 


uncertain 


probably 


definitely 


3.  If  this  instructor  were  to  explain  to  you  his  philosophy  about 

te^h-ing  how  likely  do  you  think  it  is  that  you  would  agree  with 
bin?  (circle  one) 


1 

would 

definitely 

disagree 


2 

would 

probably 

disagree 


3 

would 
neither 
agree  nor 
disagree 


k 

would 

probably 

agree 


5 

would 

def initely 
agree 


h.  Based  upon  the  information  provided  about  this  person,  how  likely 
do  you  think  it  is  that  you  would  agree  with  his  beliefs  about 
students?  (circle  one) 


1 

would 

definitely 

disagree 


2 

vculd 

probably 

disagree 


3 

would 
neither 
agree  nor 
disagree 


k 

would 

probably 

agree 


5 

would 

definitely 

agree 


5.  If  this  instructor's  performance  changes  in  the  future,  in  which 
direction  do  you  think  the  change  will  occur?  (circle  one) 


1 

his 

performance 
will  decline 
significantly 


2 

his 

performance 
will  decline 
a little 


I do  not 
think  it 
will  change 


4 

his 

performance 

will 

Improve  a 
little 


5 

his 

performance 
will  improve 
significantly 


k. 
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DEBRIEF  HE  QUESTIONNAIRE 


d/4 


Sex:  Male  Female  (circle  one) 


Classification:  Freshman  Sophomore  Junior 

Senior  (check  one) 


1.  As  you  were  actually  rating  the  instructors,  what  thoughts  went  through 
your  mind? 


2.  How  many  of  the  instructors  appeared  to  you  to  be  actually  working? 


3.  To  the  best  of  your  recollection,  how  many  of  the  instructors  were 
smiling?  

h.  How  many  of  the  instructors  wore  glasses?  

5.  About  how  long  (on  the  average ) did  it  take  you  to  fill  out  a 
rating  form  on  a given  instructor?  (check  one) 

less  than  a minute  

about  two  minutes  

about  three  minutes  

about  five  minutes  

about  eight  minutes  

6.  How  many  of  the  instructors  were  sitting  at  a desk?  

7.  What  do  you  think  is  the  most  Important  thing  for  a college  instructor 
to  accomplish  when  teaching  a course? 


8.  How  many  of  the  Instructors  vould  you  guess  could,  be  classified  as 
members  of  a minority  group?  


9.  How  many  of  the  instructors  would  you  say  were  neatly  dressed? 


10.  If  you  could  have  had  miditional  information  about  each  of  the 
instructors,  what  information  would  you  have  wanted  most? 


11.  Did  you  find  some  of  the  instructors  easier  to  rate  than  others, 
if  so  why? 

yes  because 


no  because 
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